added smollama base model - 1B parameter #39543

dustinwloring1988 · 2025-07-21T01:08:30Z

New base model added @ArthurZucker

smollama
-1B parameter model
-Modified llama architecture to use NoPE
-Tokoenizer from smollm3

smollama -1B parameter model - Modified llama architecture to use NoPE - Tokoenizer from smollm3

ArthurZucker

Thanks for the PR! Could you have a look into isolating the differences with Llama using modular: https://huggingface.co/docs/transformers/en/modular_transformers ! This will help a lot 🤗

TLDR write a modular_small_llama.py file that imports from llama4 or llama or other models that use NoPE, to isolate the diff then run python utils/modular_model_converter.py --files small_llama

dustinwloring1988 · 2025-07-23T01:33:47Z

@ArthurZucker yes no problem sorry did not see this until now.

added smollama

d66d3c7

smollama -1B parameter model - Modified llama architecture to use NoPE - Tokoenizer from smollm3

ArthurZucker reviewed Jul 21, 2025

View reviewed changes

ArthurZucker added the New model label Jul 21, 2025

dustinwloring1988 closed this Jul 29, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

added smollama base model - 1B parameter #39543

added smollama base model - 1B parameter #39543

Uh oh!

dustinwloring1988 commented Jul 21, 2025 •

edited

Loading

Uh oh!

ArthurZucker left a comment •

edited

Loading

Uh oh!

dustinwloring1988 commented Jul 23, 2025

Uh oh!

Uh oh!

added smollama base model - 1B parameter #39543

added smollama base model - 1B parameter #39543

Uh oh!

Conversation

dustinwloring1988 commented Jul 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ArthurZucker left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dustinwloring1988 commented Jul 23, 2025

Uh oh!

Uh oh!

dustinwloring1988 commented Jul 21, 2025 •

edited

Loading

ArthurZucker left a comment •

edited

Loading